Using Naı̈ve Text Queries for Robust Audio Information Retrieval
نویسندگان
چکیده
The goal of this work is to build an audio information retrieval system which provides users with flexibility in formulating their queries: from audio examples to naı̈ve text. Specifically, the focus of this paper is on using naı̈ve text to create input queries describing the desired information of the users. Using naı̈ve text queries, however, raises interoperability issues between annotation and retrieval processes due to the wide variety of available audio descriptions. In this paper, we propose an intermediate audio description layer (iADL) to solve the interoperability issues between the annotation and retrieval processes. The iADL comprises two axes corresponding to semantic and onomatopoeic descriptions based on human-to-human communication experiments on how humans express sounds verbally. Various text modeling schemes, such as latent semantic analysis (LSA) and latent topic model, are utilized to transform the naı̈ve text onto the proposd iADL.
منابع مشابه
Using naïve text queries for robust audio information retrieval
The goal of this work is to build an audio information retrieval system which provides users with flexibility in formulating their queries: from audio examples to naı̈ve text. Specifically, the focus of this paper is on using naı̈ve text to create input queries describing the desired information of the users. Using naı̈ve text queries, however, raises interoperability issues between annotation and...
متن کاملA system for spoken query information retrieval on mobile devices
We present a system which allows the user to search for information on mobile devices using spoken natural language queries. This is the first work that we are aware of which evaluates spoken query based information retrieval on a commonly available and well researched text database, the Chinese news corpus used in National Institute of Standards and Technology (NIST)’s TREC-5 and TREC-6 confer...
متن کاملA new term-weighting scheme for naïve Bayes text categorization
Purpose – Automatic text categorization has applications in several domains, for example e-mail spam detection, sexual content filtering, directory maintenance, and focused crawling, among others. Most information retrieval systems contain several components which use text categorization methods. One of the first text categorization methods was designed using a naı̈ve Bayes representation of the...
متن کاملMandarin-English Information (MEI)
Mandarin-English Information (MEI) is one of the four projects selected for the Johns Hopkins University Summer Workshop 2000. We plan to develop technologies for using written queries to search spoken documents (cross-media) between English and Mandarin Chinese (cross-language). Our research focus is on the integration of speech recognition and machine translation technologies in the context o...
متن کاملModeling music and words using a multi-class naı̈ve Bayes approach
We propose a query-by-text system for modeling a heterogeneous data set of music and words. We quantitatively show that our system can both annotate a novel song with semantically meaningful words and retrieve relevant unlabeled songs from a database given a text-based query. We explain two feature extraction methods useful for summarizing the audio content of a song. We describe a supervised m...
متن کامل